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Abstract 

The sequence ai ■ ■ ■ a m is a common subsequence in the set of per- 
mutations S — {tvi, . . . , 7Tfc} on [n] if it is a subsequence of 7i~i(l) ■ ■ ■ ^i(n) 
and 7Tj(l) • • ■ 7Tj(n) for some distinct Hi,Hj £ S. Recently, Beame and 
Huynh-Ngoc (2008) showed that when k > 3, every set of k permutations 
on [n] has a common subsequence of length at least n 1 ^ 3 . 

We show that, surprisingly, this lower bound is asymptotically optimal 
for all constant values of k. Specifically, we show that for any k > 3 and 
n > k 2 there exists a set of k permutations on [n] in which the longest 
common subsequence has length at most 32(fcn) 1//3 . The proof of the 
upper bound is constructive, and uses elementary algebraic techniques. 
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1 Introduction 



The sequence a\ • • ■ a rn is a common subsequence in the set S = {tti, . . . ,7Tfe} of 
permutations on [n] if it is a subsequence of 7Tj(l) • • • 7Tj(n) and 7Tj (1) • • • 7Tj (n) 
for some distinct ni,TTj G S. In this article, we study the minimum length of 
the longest common subsequence(s) in a set of k permutations on n. 

Definition 1. Let fk(n) denote the maximum value m for which every set of 
k permutations on [n] is guaranteed to contain a common subsequence of length 
m. 

The celebrated Erdos-Szekeres Theorem [5] states that every sequence of 
length n contains a monotone subsequence of length [n 1 / 2 ] . In our terminol- 
ogy, the theorem states that for every permutation tt on n, the set {ir, 1, l r } 
contains a common subsequence of length at least [ro 1 / 2 ] , where t is the identity 
permutation and c R is its reversal. 

As a consequence of the Erdos-Szckcrcs Theorem, sets of permutations that 
include a permutation and its reversal can not hope to show an upper bound 
stronger than fz{n) < [n 1 / 2 ] . The bound on fs(n) is in fact much smaller: as 
Beame and Huynh-Ngoc [5] recently showed, fs{n) = fi{n) = [n 1 / 3 ] . 

When k > 4, the exact values of the function fk(n) are not currently known. 
A simple probabilistic argument establishes an upper bound of 

f k {n) < 2eV^ (1) 

for every k < e ev/ ™, and a counting argument shows that for every k > 3, 

/*(»)> In 1 / 3 ]. (2) 

(Proofs of (p} and |(5J) arc included in the Appendix.) The goal of the research 
presented in this note was to determine the correct asymptotic behavior of fk (n) . 

1.1 Our results 

We present two results in this paper. The first result uses Hadamard matrices 
to show that fk(n) grows asymptotically slower than 1/n. 

Theorem 1. Let k > 4 be an integer such that a Hadamard matrix of order k 
exists. Then f k (n) < \n~~\ k / 2 ^ 1 . 

A slightly weaker form of Theorem[T]was mentioned in a preprint [3] of [2] but 
only the details for the case k = 4 were included. Beame and Huynh-Ngoc also 
conjectured in [3] that the bound in Theorem [1] is tight, up to a multiplicative 
constant, for every k power of 2. 

Our main result disproves the conjecture, showing that fk(n) grows at a rate 
proportional to n 1 / 3 for every constant k. 

Theorem 2. For 3 < k < n 1 / 2 , f k {n) < 32(kn) 1 / 3 . 

Combined with the lower bound in ([2]), Theorem [2] completely characterizes 
the behavior of fk{n) for every constant k, up to a multiplicative constant. 
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1.2 Motivation and other related work 

Streaming algorithms. The behavior of /fc(n) was first examined in [5] while 
studying the read/ write streaming computation model introduced by Grohe and 
Schweikardt In this model, an algorithm can store an unlimited amount 

of temporary data in multiple auxiliary streams, but tries to minimize both its 
memory size requirements and the total number of passes it makes on the data 
streams. 

In [5], lower bounds on fk(n) were shown to give complexity upper bounds 
on algorithms for the permuted promise set-disjointness problem, an important 
problem in the read/write stream model. In particular, the bound fs{n) > n 1 / 3 
was used to show the existence of an algorithm that requires only logarithmic 
memory and a constant number of passes when n < p 3 / 2 /64, where n is the 
size of the universal set and p is the number of input subsets. The conjecture 
in [5] that fk(n) > n 1 / 2 ^ ^ for some k that is would have improved the 
result to show that the same algorithm would also work for any n < p 2 ~°W, 
matching the lower bound for the problem. Our result, however, strongly refutes 
the conjecture. 

Error-correcting codes. A code over a metric space (M, d) is a set C of 
elements - called codewords - from M. The code C has distance <5 if for every 
two distinct codewords Ci,C2 G C, d(cx,C2) > S. Two central problems in the 
study of error-correcting codes involve determining the largest code with a given 
distance, and the dual problem of identifying the maximal distance of any code 
with |C| = k codewords. 

In the study of codes that correct deletion errors, the metric of interest is 
the deletion distance, where ddei(jr, a) is one half the number of deletions and 
insertions required to turn the sequence 7r(l) • • • 7r(n) to <r(l) • • • a(n). Our 
results on fk(n) have a direct implication for codes built over (<S„, ddei)'- a code 
of size k over this metric space has maximal distance n — fk(n). 

There has been extensive research on error-correcting codes built over a 
metric space defined by a deletion distance [TJ [TH Q31 US] , and on codes built 
over the symmetric group S n [H EE H] . ^ s ^ ar as we know, however, our result 
is the first to explicitly provide bounds on the capabilities of error-correcting 
codes built over (S n ,dd e i)- 

Combinatorics on sets of permutations. The study of /fe(n) falls into the 
area of combinatorics on sets of permutations, an area that extends beyond the 
field of error-correcting codes. In particular, we highlight the exciting recent 
result of Ellis, Friedgut, and Pilpel [7], who settled a conjecture of Frankl and 
Deza [S] by showing that for any set S of permutations on n in which the 
Hamming distance between every pair of distinct 7r, a G S is dnami^, & ) < n—m, 
the size of S must be at most (n — m)\. 
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2 Proof of Theorem [T] 



Recall that a Hadamard matrix H of order k is a k x fe ±l-matrix with the 
property that every two distinct rows in H differ in exactly k/2 entries. We use 
the rows of Hadamard matrices to construct k permutations that have no long 
common subsequence. 

Theorem [1] (Restated). Let k > 4 be an integer such that a Hadamard matrix 
of order k exists. Then 

h{n)<\n^] k/2 -\ 

Proof. Let s = [n 7 ^ 1 ] and n' — s k ^ 1 > n. We will show that fk{n') < s k / 2 ^ 1 . 

There is a natural bijection <f> from [n'\ into the (fc — l)-dimensional integer 
lattice [s] fe_1 given by 

(j>{x) = (0! (x), (j> k -l(x)) 

where 4>i(x) is the i-th digit of x — 1 in base s, with the left-most digit being 
the most significant. Note that under (f>, the standard ordering on [n'] induces 
the lexicographic ordering on the vectors in [s] fe_1 . 

The idea of the construction is to use the i-th row of the Hadamard matrix 
to define k — 1 permutations on [s]. The i-th permutation in the set is then 
chosen as the "outer product" of these k — 1 permutations. 

More precisely, let H be a k x k Hadamard matrix whose rows and columns 
are indexed by {0, . . . , k — 1} and whose first row and column entries (without 
loss of generality) are all 1. Define the permutation txh : [s] — > [s], depending 
on the last k — 1 columns of H, by 




where i is the identity permutation and l r is the reversal permutation. The 
permutation 7Tj is then given by 

7T 4 (x) = _1 (7r i4 (0i(x)), . . . ,7ri,fc_i(0fe_i (x))). 

Because converts the lexicographic order on [s] fe_1 to the standard order 
on [n'], the relative order of distinct elements x, y € [n'\ in Ki depends only on 
their relative order in n^t for the first (most-significant) coordinate I £ [k — 1] 
such that 4>i(x) ^ 4>i{y)] in particular, since the only choices for ■ni^,itj^ are i 
and l r , it follows that x and y have the same relative order in TTi and in ttj if 
and only if 7Tj ^ = ttj^. 

We reason by contradiction. Assume that there exist two permutations 
7Ti, TTj in {tti, . . . , 7Tfc} that have a common subsequence of length greater than 
s fc/2-i _ |" ri -fc^T"| fe / 2 1 Let Lij be the set of indices of columns among the last 
k — 1 columns of H in which the rows i and j have the same value in H. By 
assumption on H, we have \Lij\ = k/2 — 1. So, by the Pigeonhole Principle, 
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there must exist distinct x, y in the common subsequence of 7Tj and nj such that 
4>i(x) — 4>t{y) for every I S L^.j. But then 71^ ^ 71^ for the first index I such 
that 4>z(x) ^ <t>i{y): and so a; and y do not have the same relative order in 7T; 
and TTj . This contradicts the fact that x and y are in a common subsequence of 
TTi and 7Tj and completes the proof of the theorem. □ 

3 Main result 

Theorem [2] (Restated). For every 3 < k < n ' , 

fk(n) < 32(fcn) 1/3 - 

There are two main ingredients used in the construction that establishes 
the upper bound of fkip) in Theorem [2j a bijection <p n> k that maps the in- 
tegers l,...,n to a 3-dimensional integer lattice, and k triples of functions 
(51, i' 52,i: 53,i)' (ffi.fc) 92, fc,33 : fc) that are used to generate orderings of the 
elements of the 3-dimensional lattice. 

Let si — (n/fc 2 ) 1 / 3 and s 2 = S3 = (nfc) 1 / 3 . For simplicity, we first assume 
that Si, S2, and S3 are integers. The general case will be easily dealt with later. 
Since k < n 1 / 2 , we have si > 1 and S2 = S3 < n 1 / 2 . Let X be the 3-dimensional 
integer lattice [si] x [S2] x [S3], and let the bijection : [n] — ► X be the 
function whose inverse is given by 

( K]kk x ^y^ z ) = x + s i{v - !) + sis 2 {z - 1). 

This mapping associates the standard ordering on [n] with the lexicographic 
ordering on (x, y, z) tuples in X in which the x coordinate is least significant 
and the z coordinate is most significant. The reason for the smaller range of x 
coordinates relative to the other two will become apparent in the analysis. 

Let p be the smallest prime larger than 4S3. For j = 1, . . . , k, the functions 
gij : X — * Z, : X — > Z, and g^j : X — > Z are defined by 

33 j (x, y,z) = fx + 2jy + 2z mod p, 
92,j(x,y,z) = jx + y, and 
9i,j(x,y,z) = x. 

For every i G {1, 2, 3} and j € {1, . . . , A;}, define = o 0„ fc and set 
hj = (hi j, h2,j, hs,j)- Note that although the image of [n] under <p nt f. is the set 
X, the image of [n] under an hj is a set of triples not constrained to lie in X. 
We first see that each hj is 1-1 on [n]. 

Proposition 3. For any j € [k] and distinct a, b £ [n] we have hj(a) 7^ hj(b). 

Proof. Suppose for contradiction that there exist a, 6 € [n] such that hi t j(a) = 
hij(b), h2 t j(a) = h2,j(b), and h^j{a) = ha t j(b). Then, by definition, there exist 
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two distinct points (x a ,y a , z a ), (xb, Ub, Zb) € X such that 

fx a + 2jy a + 2z a = j 2 x b + 2jy b + 2z b (mod p), 
jXa+Va = jx b + y h , and 

Xa — Xb, 

which implies that x a = x bl y a = 2/6, and z a = z b (mod p). Since p > S3, we 
have contradiction. □ 

For j = 1, . . . , k, the function hj determines a total order <j on [n] as follows: 
For a, b e [n], write a <j 6 iff hj(a) is less than in the lexicographic order 

on integer triples in which the third coordinate is most significant and the first 
coordinate is the least significant. 

Let TTj be the permutation on [n] that orders the elements in [n] in increasing 
order as defined by < j . That is, let ttj be the permutation such that 

^(i) <j Tj(2) <j ■■■ <j Kj(n). 

As we show below, the set of permutations {tt\, . . . , 7Tfe} has no common subse- 
quence of length greater than 16(nfc) 1//3 . 

Lemma 4. For 3 < k < n 1 / 2 let m, . . . , n k be the k permutations on {1, ... ,n} 
defined above. Then {m, . . . ,iik} has no common subsequence of length greater 
than W(nk) 1 ^. 

Proof. Let a\ a-i ■ ■ ■ a s be a subsequence of m and nj, for some 1 < i < j < k. 
Then 

ai <i a<i <i ■ ■ ■ <i a s and 

0-1 <j 0,2 <j ••• <] O s . 

In particular, this implies that /i3,i(ai) < ^.3,1(02) < • • • < /i3,i(a s ) and h^^{a\) < 
hz,j{a2) < ■ ■ ■ < hsj(a s ). The functions h^^ and h^j can each take p different 
values, so any sequence of distinct pairs (/i3,i(ai), h3j(ai)), . . . , {fi3 t i(a s ), h3j(a s )) 
satisfying the increasing property can have at most s = 2p — 1 elements. 
Bcrtrand's Postulate guarantees that p < S(nfc) 1 / 3 , so to prove the claim it 
is sufficient to show that the pairs (/i3,i(a t ), ft.3j(a t )) for t e {1, . . . , s} must be 
distinct. 

We prove by contradiction that the pairs (/i3,i(a t ), h3j(a t )) must be distinct 
for i 7^ j. Assume that there exist two indices t ^ t' in {1, . . . , s} such that 

h3,i( a t) = h 3 ,i(a t ') and 
h>3,j(at) = h 3i j(a t >). 

Then, letting n ,fc(a t ) = (x t ,y t ,z t ) and (j> n ,k( a t') = i x t' ,Dt> , Zf), the above 
equivalences imply that 

i 2 (x t - av) + 2i(y t - y v ) + 2(z t - z t <) = (mod p) and 
j 2 (a; t - x t /) + 2j(y t - y t /) + 2(z t - z t ,) = (mod p). 



G 



Taking the difference of these equations, we observe that 

{i 2 - j 2 )(x t - x v ) + 2(i - j)(y t - y v ) = (mod p). 

Since k < n 1 / 2 we have S3 = (nfc) 1 / 3 > k and so p > Ak. Therefore < j — i < p 
and hence i — j ^ (mod p). Thus, 

(i + j){x t - x v ) + 2(y t - y v ) = (mod p). 

In fact, sincc0 

|(i +j)(xt -x t >) + 2{y t - y t ,)\ < 2k Sl + 2s 2 < 2k(n/k 2 )^ 3 + 2{nk)^ 3 < p, 

the only possible solution to the last equation is when 

(i + j)(x t -x t >)+2(y t -y t >)=0. (3) 

Observe now that if xt = Xf then from ([3]) we derive that yt — yv and, since 
h>3,i(at) = hi t i{att) and p > 4s3 we can conclude that this would imply that 
z t = z t > which violates our assumption that a t 7^ a t > . Therefore Xt x t > . 
Assume without loss of generality that Xt — x t > > 0. 

Using the fact that i < j, by replacing i and j in ([3]) we see that 

2i{x t -x t ') +2(y t - y t >) < =>■ ix t + y t < ix t > + y t > and 
2j(x t - x t ') + 2{y t - y t ,) > jx t + y t > jx t > + yv ■ 

By the lexicographic ordering on triples this means that a t <i a t > and av <j a*, 
so a t and av cannot be elements of a common subsequence of 7r^ and 7Tj, and 
we have arrived at the desired contradiction. □ 

Lemma|4]thus proves Theorem[2]for the case that si, S2, and S3 are integers. 
For the general case, let s[ — [(n/fc 2 ) 1 / 3 ] and let n' = (s'^k 2 < 8n. Then 
= (n'/fc 2 ) 1 / 3 and s' 2 = s' 3 — (n'fc) 1 / 3 = s[k are also integers. Repeating the 
same argument, we get 

fk(n) < /fc(n') < Win'k) 1 / 3 < 32(nfc) 1 / 3 . 

This proves Theorem [2] 
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A Probabilistic upper bound on fk(n) 



The Erdos-Szekeres Theorem stimulated a long line of research into the dis- 
tribution of the length of the longest increasing subsequence in a random per- 
mutation [16] . The results from this line of research yield a tidy probabilistic 
argument establishing the upper bound of fk (n) in |lj . 

Proposition 5. For any k < e e ^™, fk(n) < 2e v / n. 

Proof. Consider a set S formed by choosing k permutations uniformly at random 
from all permutations on [n\. As Frieze showed [101 Lemma 1], the distribution 
of the length L n of the longest increasing subsequence in a random permutation 
on [n] satisfies 

Pr [L n > 2e^/n\ < e~ 2e ^ . 

The length of the longest common subsequence in two random permutations 
7Ti and TTj follows the same distribution as the length of the longest increas- 
ing subsequence in the random permutation iti o nj 1 , so the probability that 
the set S contains a common subsequence of length at least 2e^/n is at most 
(a) e ~ 2e ^™ < 1- Therefore, there must exist a set S of k permutations with a 
longest common subsequence of length less than 2e^Jn. □ 

B Lower bound on fk(n) 

Beame and Huynh-Ngoc [2] showed that f^n) = rt 1 / 3 , which in turn implies 
that fk(n) > rt 1//3 for every k > 3. For completeness, we include the proof of 
that result here, along with a small improvement for larger values of k obtained 
with the Pigeonhole Principle. 

Proposition 6. Let k > 3 and let m = m(k) be the largest integer such that 
m! < k and m < n. Then 

fk{n) > max [n 1 ^ , rnj . 

Proof. We begin by showing that fk(n) > fz{ n ) ^ n 1 ^ 3 , using an extension of 
the counting argument in Seidenberg's proof [14] of the Erdos-Szekeres Theorem. 

Assume for contradiction that there is a set S — {iti, ^2,^3} of permutations 
on [n] for which every common subsequence has length strictly less than rt 1 / 3 . 
For every I = l,...,n, define (f)(1) = (4>i,2(£), ^1,3^), <fe,3(^)), where 4>ij(£) is 
the length of the longest common subsequence of TTi and 7Tj that begins with I. 
By assumption, 4>ij(i) < n 1 / 3 . Hence there are strictly fewer than n possible 
values of <j>(t), which means that there exist i ^ £' such that (f>(£) = <p(£'). But 
there must be two permutations 7Ti , Hj that order I and £' in the same way, say 
w.l.o.g. £ is ordered before £' . This implies that :(£) > (f>i,j(£') so (f>(£) ^ (/)(£'), 
a contradiction. 

The second part of the theorem follows easily from the Pigeonhole Principle: 
every set S — {iti, . . . , 7Tfc} must contain two permutations 7Tj and TTj that order 
the elements 1, . . . , m identically. This completes the proof. □ 
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